Multi-Word Verbs In A Flective Language: The Case Of Estonian

نویسندگان

  • Heiki-Jaan Kaalep
  • Kadri Muischnek
چکیده

This paper describes automatic treatment of multi-word expressions in a morphologically complex flective language – Estonian. It focuses on a special type of multi-word expressions – the verbal multi-word expressions that can function as predicates. Authors describe two language resources – a database of verbal multi-word expressions and a corpus where these items have been annotated manually. The analysis of the annotated corpus demonstrates that the Estonian verbal multi-word expressions alternate in several grammatical categories. Different types of the verbal multi-word expressions (opaque and transparent idioms, support verb constructions and collocations) behave differently in the corpus with regard to the freedom of alternation. The paper describes main types of these alternations and the methods for dealing with them automatically.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a Semantic Similarity Judgment Test for Persian Action Verbs and Non-action Nouns in Patients With Brain Injury and Determining its Content Validity

Objective: Brain trauma evidences suggest that the two grammatical categories of noun and verb are processed in different regions of the brain due to differences in the complexity of grammatical and semantic information processing. Studies have shown that the verbs belonging to different semantic categories lead to neural activity in different areas of the brain, and action verb processing is r...

متن کامل

Using the Text Corpus to Create a Comprehensive List of Phrasal Verbs

The paper describes extraction of Estonian multi-word verbs from text corpora, using a languageand task-specific software tool SENVA, which is based on a statistical language-independent software tool SENTA (Dias et al, 2000). The outcome is a comprehensive list of 16,000 phrasal verbs. We describe the extraction tool, manual post-editing principles, and evaluate the outcome in terms of precisi...

متن کامل

Automatic WSD: Does it Make Sense of Estonian?

This paper describes a fully automatic Estonian word sense disambiguation system called semyhe which is based on Estonian WordNet (EstWN) hyponymjhypernym hierarchies and meant to disambiguate both nouns and verbs. 1 Short description of the system The main inspiration for our system is Agirre and Rigau (1996) similar system that disambiguates the English noun senses based on WordNet hyponymjhy...

متن کامل

How textbooks (and learners) get it wrong: A corpus study of modal auxiliary verbs

Many  elements  contribute  to  the  relative  difficulty  in  acquiring  specific  aspects  of  English  as  a foreign  language  (Goldschneider  &  DeKeyser,  2001).  Modal  auxiliary  verbs  (e.g.  could,  might), are  examples  of  a  structure  that  is  difficult  for  many  learners.  Not  only  are  they  particularly complex  semantically,  but  especially  in  the  Malaysian  context ...

متن کامل

Partitive Morphosemantics across Estonian Grammatical Categories, and Case Variation with Equi and Raising

This article discusses two related case marking phenomena in Estonian. The first issue is representing partitive morphosemantics as it is expressed in the language specific aspectual, epistemic modal and evidential grammatical categories. A DRT based sketch of propositional attitudes and speaker-hearer relationships explicates several semantic links between the categories. The second case marki...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006